Modification of pitch using DCT in the source domain

نویسندگان

  • Rangarao Muralishankar
  • A. G. Ramakrishnan
  • P. Prathibha
چکیده

In this paper, we propose a novel algorithm for pitch modification. The linear prediction residual is obtained from pitch synchronous frames by inverse filtering the speech signal. Then the Discrete Cosine Transform (DCT) of these residual frames is taken. Based on the desired factor of pitch modification, the dimension of the DCT coefficients of the residual is modified by truncating or zero padding, and then the Inverse Discrete Cosine Transform is obtained. This period modified residual signal is then forward filtered to obtain the pitch modified speech. The mismatch in the positions of the harmonics between the pitch modified signal and the LP spectrum introduce gain variations, which is more pronounced in the case of female speech [16]. This is minimised by modifying the radii of the poles of the filter to smoothen the peaky linear predictive spectrum before forward filtering. This pitch modification scheme is used in our Concatenative Speech synthesis system for Kannada. The technique has also been successfully applied to creating interrogative sentences from affirmative sentences. KeywordsLinear Prediction, Concatenative synthesis, residual signal, resampling, 3dB bandwidth.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-Focus Image Fusion in DCT Domain using Variance and Energy of Laplacian and Correlation Coefficient for Visual Sensor Networks

The purpose of multi-focus image fusion is gathering the essential information and the focused parts from the input multi-focus images into a single image. These multi-focus images are captured with different depths of focus of cameras. A lot of multi-focus image fusion techniques have been introduced using considering the focus measurement in the spatial domain. However, the multi-focus image ...

متن کامل

Dct Based Pitch Modification

In this paper, we propose a novel algorithm for pitch modification. The linear prediction residual is obtained from pitch synchronous frames by inverse filtering the speech signal. Then Discrete Cosine Transform (DCT) is applied on these pitch synchronous frames. Based on the desired factor of pitch modification, the dimension of the DCT vector is changed by truncation or zero padding, and then...

متن کامل

Non-linear Pitch Modification in Voice Conversion Using Artificial Neural Networks

Majority of the current voice conversion methods do not focus on the modelling local variations of pitch contour, but only on linear modification of the pitch values, based on means and standard deviations. However, a significant amount of speaker related information is also present in pitch contour. In this paper we propose a non-linear pitch modification method for mapping the pitch contours ...

متن کامل

A mixed-excitation frequency domain model for time-scale pitch-scale modification of speech

This paper presents a time-scale pitch-scale modification technique for concatenative speech synthesis. The method is based on a frequency domain source-filter model, where the source is modeled as a mixed excitation. This model is highly coupled with a compression scheme that result in compact acoustic inventories. When compared to the approach in the Whistler system using no mixed excitation,...

متن کامل

A Pitch-Catch Based Online Structural Health Monitoring of Pressure Vessels, Considering Corrosion Formation

Structural health monitoring is a developing research field which is multifunctional and can estimate the health condition of the structure by data analyzing and also can prognosticate the structural damages. Illuminating the damages by using piezoelectric sensors is one of the most effective techniques in structural health monitoring. Pressurized equipments are very important components in pro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Speech Communication

دوره 42  شماره 

صفحات  -

تاریخ انتشار 2004